Discovered Rule Filtering Using Information Retrieval Technique

نویسندگان

  • Yasuhiko Kitamura
  • Keunsik Park
  • Akira Iida
  • Shoji Tatsumi
چکیده

A data mining system can semi-automatically discover knowledge by mining a large volume of data, but the discovered knowledge is not always novel and interesting to the user. We propose a discovered rule filtering method to filter rules discovered by a data mining system and to produce ones that are novel and interesting to the user by using information retrieval technique. In the method, we rank discovered rules according to the result of information retrieval from the Internet. In this paper, we show the steps of discovered rule filtering by using a concrete example of clinical data mining and MEDLINE document retrieval. Preliminary results show that this method has merits in not only filtering discovered rules but also providing a new viewpoint to the rules to give a chance to invoke a new data mining process.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Micro View and Macro View Approaches to Discovered Rule Filtering

A data mining system tries to semi-automatically discover knowledge by mining a large volume of raw data, but the discovered knowledge is not always novel and may contain unreasonable facts. We try to develop a discovered rule filtering method to filter rules discovered by a data mining system to be novel and reasonable ones by using information retrieval technique. In this method, we rank disc...

متن کامل

Preliminary Evaluation of Discovered-Rule-Filtering Methods

Data mining systems semi-automatically discover knowledge by mining a large volume of data, but discovered knowledge is not always novel to users. We discuss a discovered rule filtering method to filter rules discovered by a data mining system into ones that are novel to the user by using information retrieval results from the Internet. We have two methods; the micro view and the macro view met...

متن کامل

Using a Spring Embedding Algorithm to Display Term Relationships from a Medical Concept Discovery System

Visualization can make explicit the subtle and interdependent relationships that exist among terms in a broad topical area. Such visualizations may help in refining retrieval or enhance the user’s understanding a complex area. We will demonstrate an application consisting of a term generation service, UMLS categorization of the terms, and a spring embedded visualization of the term set relation...

متن کامل

Relations between Terms Discovered by Association Rules

This paper presents an approach to automatic knowledge base construction used for automatic query expansion. We have designed a system that is able to gain domain knowledge from analysed texts. We have used a Data Mining technique called association rules. We show how this technique is able to discover knowledge about relations between terms. We study how these relations can be used to improve ...

متن کامل

A New Similarity Measure Based on Item Proximity and Closeness for Collaborative Filtering Recommendation

Recommender systems utilize information retrieval and machine learning techniques for filtering information and can predict whether a user would like an unseen item. User similarity measurement plays an important role in collaborative filtering based recommender systems. In order to improve accuracy of traditional user based collaborative filtering techniques under new user cold-start problem a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002